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ABSTRACT 

This  report  is  a  survey  of  Automated  Language  Processing  done  in 
1566.  It  is  limited  in  scope  to  analytical  processing  of 
natural  language,  excluding  work  in  programming  languages,  speech 
recognition,  and  statistical  processing  of  text.  It  focuses  on 
work  aimed  at  generating  and  analyzing  sentences  of  a  natural 
language.  This  survey  has  four  major  sections: 

The  first,  on  syntactic  theory,  contains  a  summary  of  the  prin¬ 
cipal  assumptions  underlying  work  in  generative  grammar  and  a 
report  of  the  most  significant  developments  in  theoretical  and 
descriptive  work  in  syntax.  The  second  section,  on  semantic 
theory,  attempts  to  provide  some  dimensions  along  which  we  can 
Judge  various  theories  that  have  been  proposed  and  developed 
in  the  literature  in  1966.  A  number  of  empir'cal  studies  re¬ 
lated  to  semantics  and  psycholinguistics  are  reported  in  a 
third  section.  Finally,  a  fourth  section  discusses  various 
computer  systems  for  manipulation  of  natural  language-  These 
range  from  systems  that  support  linguistic  studies  to  systems 
that  are  attempting  to  utilize  natural  language  as  a  communica¬ 
tion  medi"m. 
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INTRODUCTION 

The  area  of  automated  language  processing  covers,  in  its  broadest 

I 

sense,  any  use  of  computers  to  process  any  type  of  language. 

This,  obviously,  is  too  broad  a  field  to  cover  in  this  chapter, 
so  we  begin  by  delimiting  our  scope.  First,  only  the  processing 
of  natural  language  will  be  considered,  thus  excluding  work  on 
artificial  languages  such  as  programming  languages  and  command 
and  control  languages.  Furthermore,  we  are  interested  here  only 
with  analytical  (i.e.,  non  statistical)  processing  of  natural 
languages,  thereby  excluding  most  work  in  automatic  indexing, 
abstracting,  content  analysis  and  stylistic  analysis.  Finally, 
we  restrict  our  examination  to  analytic  processing  of  natural 
language  that  is  represented  in  its  input  or  output  form  in 
normal  orthography,  thus  excluding  work  in  speech  analysis  and 
synthesis,  and  optical  pattern  recognition.  Accordingly,  we  will 
focus  on  the  work  being  performed  to  generate  and  analyze  sen¬ 
tences  of  a  natural  language  (usually  English)  in  terms  of  some 
grammar,  or  data  base,  or  both,  and  then  doing  something  with 
this  analysis  -  perhaps  only  printing  it  out,  or  at  the  other 
extreme,  answering  questions  contained  in  the  input. 


I 

I 

I 


This  survey  has  four  major  sections.  The  first,  on  syntactic 
theory,  contains  a  summary  of  the  principal  assumptions  under¬ 
lying  work  in  generative  grammar  and  a  report  of  the  most  sig¬ 
nificant  developments  in  theoretical  and  descriptive  work  in 
syntax.  The  second  section,  on  semantic  theory,  attempts  to 


provide  some  dimensions  along  which  we  can  ,^udge  various 
theories  that  have  been  proposed  and  developed  in  the  literature 
in  i960,  A  number  of  empirical  studies  related  to  semantics  and 
psycholinguistics  are  reported  in  a  third  section.  Finally,  we 
conclude  with  a  discussion  of  various  computer  systems  for  mani¬ 
pulation  of  natural  language.  These  range  from  systems  that 
support  linguistic  studies  to  systems  that  are  attempting  10 
utilize  natural  language  as  a  communication  medium.  We  have,  in 
general,  considered  literature  from  1966  only,  since  Simmons 
survey  (106)  of  last  year  covered  material  through  1965. 


SECTION  II 


SYNTACTIC  THEORY 

The  linguistic  framework  within  which  almost  all  of  the  current 
wor  in  language  processing  (as  delimited  earlier)  is  carried  out 
involves  the  theory  of  language  developed  by  Chomsky  and  others 
that  was  introduced  in  Syntactic  Structure  (17),  and  elaborated 
on  in  Aspects  of  the  Tneory  of  Syntax  (1*0.  A  very  lucid  pre¬ 
sentation  of  this  material  can  be  found  in  "Computer  Analysis 
of  Natural  Languages"  by  Susurnu  Kuno  (68).  However,  let  us 
review  quickly  the  assumptions  underlying  work  in  generative 
grammars  of  this  type.  First,  the  goal  of  linguists  in  writing 
grammars  is  to  state  clearly  and  concisely  those  facts  that  a 
native  speaker  of  the  language  knows  about  his  language.  This 
knowledge  has  been  called  the  competence  of  the  speaker  of  a 
language;  thus  a  grammar  may  be  thought  of  as  a  model  for  the 
ideal  speaker's  competence.  Such  knowledge  of  the  language  is, 
to  be  sure,  not  always  obvious.  For  example,  a  speaker  of 
English  can  provide  the  correct  plural  ending  on  any  noun,  even 
if  it  is  a  nonsense  word;  few  speakers  recognize,  however,  that 
the  different  endings  they  systematically  evoke  are  directly 
correlated  with  the  final  sound  of  the  word,  a  regularity 
statable  in  three  simple  phonological  rules.  Furthermore,  many 
of  the  regularities  of  a  language  can  be  captured  only  by 
abstract  representations  of  what  are  eventually  the  surface  facts. 


But  to  state  such  regularities  of  a  language  is,  by  itself,  not 
the  sole  end.  A  more  far-reaching  goal  consists  of  finding  uni¬ 
versal  features  of  all  languages,  from  Chinese  to  English  to 
Sanscrit,  and  of  determining  a  maximally  simple  specification  of 
those  universal  facts,  along  with  the  idiosyncratic  features  of 
particular  languages.  Underlying  this  approach  is,  obviously, 
the  assumption  that  all  natural  languages  share  a  large  common 
area  of  similarity,  albeit  at  a  rather  abstract,  level.  Further¬ 
more,  although  there  is  some  disagreement  on  this  point,  it  is 
generally  assumed  that  there  does  not  exist  at  present  any 
discovery  procedure  for  determining  the  most  insightful  linguistic 
analysis  for  a  particular  part  of  some  given  language  in  terms  of 
a  particular  linguistic  theory.  Nor,  it  appears,  is  there  any 
reason  to  assume  one  will  be  forthcoming  in  the  near  future. 

A  transformational  grammar  is  intended  to  be  neither  a  psycho¬ 
logical  model  for  the  user  of  a  language  nor  a  pedagogical  tool. 
The  speaker's  competence  in  a  language,  those  facts  about  the 
language  that  a  grammar  is  intended  to  capture,  is  not  always- 
reflected  in  his  actual  performance ,  for  instance,  while  he  is 
being  affected  by  noise,  interruptions,  indigestion,  and  the  like. 
A  theory  of  performance  is  an  important  area  of  study,  but  it  is 
not  currently  being  considered  by  most  linguists.  They  feel  that 
the  restrictions  that  are  necessarily  placed  on  the  existing 
language  processing  systems  are  completely  ad  hoc .  Consequently, 
although  most  grammars  that  have  been  used  in  language  proc  sslng 
are  generative  in  nature,  beginning  with  the  syntactic  portion  of 
a  sentence  first  and  then  accounting  for  the  semantic  and  phono¬ 
logical  portions,  this  should  be  not  held  as  a  model  for  actually 
producing  speech  nor  for  its  interpretation  by  the  hearer.  Nor 
should  any  rules  in  any  grammar  to  date  be  thought  of  as  analogous 
to  the  accual  neurophysiological  manipulations  that  occur. 
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Hopefully,  the  relationships  that  the  rules  of  a  grammar  depict 
have  some  psychological  relevance,  bu^  at  present  this  is  only 
speculation. 
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A  transformational  grammar  consists  of  three  major  rule  components: 
a  syntactic  component,  a  semantic  component,  and  a  phonolcgical 
component.  The  syntactic  component  is  the  only  creative  component 
of  the  transformational  grammar.  It  contains  a  base  subcomponent 
(a  context-free  phrase  structure  grammar),  a  transformational 
subcomponent,  and  a  lexicon.  The  structures  produced  by  the  base 
component,  with  lexical  substitutions,  ar<^  designed  to  represent 
all  of  the  semantic  relationships  of  the  sentence  being  generated. 
The  semantic  component,  applied  to  these  abstract  constructs 
generated  by  rules  of  the  base  subcomponent,  produces  a  semantic 
interpretation  for  a  sentence.  This  implies  that  many  para¬ 
phrases  having  disparate  syntactic  surface  structures  will  have 
similar  underlying  base  structures.  The  rules  of  the  transfor¬ 
mational  subcomponent,  acting  on  a  base  tree  and  on  trees  result¬ 
ing  from  the  application  of  earlier  ransformations ,  produce  the 
surface  tree  of  the  sentence.  The  structure  of  the  surface 
(derived)  tree  is  Intended  to  reflect  not  only  the  actually 
occurring  word  order  of  the  sentence,  but  also  psychologically 
significant  groupings  of  words.  Tne  phonological  rules  apply  to 
this  surface  structure  to  produce  a  representation  of  spoken 
language.  Thus,  the  semantic  and  phonological  sets  of  rules  are 
purely  interpretive  on,  respectively,  the  base  and  derived  struc¬ 
tures  of  a  sentence. 

There  were  two  outstanding  contributions  in  the  area  of  syntax 
last  year,  the  first  by  Lakoff  (70),  the  second  by  Fillmore  (26). 
Lakoff  investigated  certain  cases  of  syntactic  irregularity  and 
suggested  that  lexical  items  should  have  rule-exception  features 
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in  addition  to  the  other  syntactic  features  already  proposed  in 
Aspects  (16).  This  innovation  makes  possible  the  exclusion  from 
(or  inclusion  in)  the  domain  cf  a  transformation  some  specific 
lexical  items  that  are  not  otherwise  distinguished  as  a  group. 

Fvr  examp _e,  the  passive  transformation  applies  to  all  sentences 
having  a  verb  followed  by  a  direct  object  noun  phrase  (John  hit 
the  ball;  the  ball  was  hit  by  John)  with  the  exception  of  sen¬ 
tences  containing  a  few  verbs  like  "marry"  and  "befit."  By 
positing  the  existence  of  a  rule  feature  [+  passive]  to  be  asso¬ 
ciated  with  each  verb,  the  passive  transformation  is  now  redefined 
to  be  applicable  to  only  those  sentences  containing  verbs  speci¬ 
fied  as  [+  passive].  The  implication  of  this  suggestion  is  to 
make  the  base  structure  of  the  grammar  far  more  abstract  than 
previously.  For  example,  just  as  the  verb  "act"  will  be  speci¬ 
fied  as  [+  nominalization]  in  order  to  permit  a  nominalization 
transformation  to  derive  the  noun  "action"  from  the  verbal  form 
"act",  so  will  there  be  posited  a  verb  "idea"  similarly  specified 
such  that  the  nominalization  transformation  obligatorily  derives 
the  noun  "idea."  We  also  point  out  here  that  this  same  piece  of 
work  by  Lakoff  contains  some  new  and  certainly  suggestive 
analyses  of  parts  of  English  syntax,  independent  of  the  sugges¬ 
tions  of  rule  features,  such  as  treating  prepositions,  adjectives, 
and  verbs  in  a  relatively  uniform  manner. 

The  second  paper,  by  Fillmore,  is  an  investigation  of  the  basic 
form  of  the  base  structure  of  a  transformational  grammar.  He 
suggests  that  Chomsky's  emphasis  on  underlying  subject  and  object 
is  an  unnecessary  one  and  will  not  play  a  significant  role  in  the 
semantic  rules  that  interpret  a  base  structure.  He  argues  that 
the  relationship  between  the  lexical  items  "door"  and  "close"  in 
the  sentence  "The  door  closed"  and  "John  closed  the  door"  is 
exactly  the  same  and  that  there  is  no  way  of  expressing  this  fact 
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in  the  base  structure  formalism  presented  in  Aspects . 

Accordingly,  he  suggests  a  revision  of  the  base  structures  of 
sentences,  such  that  every  sentence  has  an  analysis  as:  possibly 
certain  adverbials,  an  auxiliary,  and  a  proposition.  The  pro¬ 
position  contains  the  verb  and  various  types  of  phrases  such  as 
the  ergative,  .gentive,  instrumental,  etc.  One  of  these  phrases, 
dominated  by  the  proposition,  eventually  becomes  the  grammatical 
subject  of  the  sentence,  subject  to  very  rigorous  constraints; 
which  are  only  sketched  out  in  his  paper.  Fillmore’s  paper  is 
certainly  provocative  but  much  further  development  of  the  ideas 
is  needed  before  their  linguistic  merit  can  be  evaluated. 

Various  other  papers  appeared  in  which  small  parts  of  English 
syntax  were  considered  within  the  transformational  framework, 
among  them  papers  by  Fraser  (31),  Peters  (50h),  Lakoff  (70), 

Ross  (50d),  Rosenbaum  &  Lochak  (95),  Postal  (90),  Schane  (101), 
and  Langendoen  (72).  All  oi  these  were  written  from  the 
linguist's  point  of  view.  Host  of  them  presuppose  at  least  some 
knowledge  of  linguistic  theory  and  were  not  written  with  a  view 
towards  computer  .-implementation  of  the  rules  and  regularities 
that  they  propose. 

There  were  two  notable  efforts  reported  in  the  literature  during 
the  last  year  for  which  the  goal  was  to  compile  a  grammar  of 
English  that  accounts  for  a  sizable  subset  of  the  language.  The 
first,  by  Chapin  (6),  and  Geiss  (116),  represents  the  current 
version  of  the  grammatical  rules  developed  for  the  MITRE  Analysis 
Procedure  for  Transformational  Grammars  and  presents  a  set  of 
sample  sentences  processed  by  that  analysis  procedure.  These 
rules  represent  a  revised  version  of  the  grammar  to  be  found  in 
The  English  Preprocessor  Manual  (8^)  and  consist  of  corrections, 
refinements,  and  the  introduction  of  a  few  new  constructions. 


-7- 


“r*  m 

I  J 


The  second  paper  is  that  by  Rosenbaum  &  Lochak  (9b)  >  which 

contains  both  an  introduction  to  some  01  the  linguistic  theory 

underlying  its  grammar,  and  the  conventions  utilized,  and  a  set  „ 

of  rules  for  a  part  of  English  syntax  with  an  emphasis  on  the 

predicate  complement  construction.  This  grammar,  which  does  not 

make  use  of  syntactic  features,  will  shortly  be  replaced  by  a 

more  sophisticated  version  that  uses  features  and  also ‘reflects 

a  number  of  innovations  in  linguistic  theory. 

•  I  J  | 

A  third  effort,  currently  underway  at  the  University  of 

California  at  Los  Angeles  under  the  direction  of  Prof.  Robert  .» 

Stockwel 1,  is  attempting  to  construct  a  consistent  transforma-  _ 

tional  grammar  of  English  syntac  that  reflects  all  work  performed  __ 

to  date.  This  project  is  about  half  way  through  its  two  and  one- 
half  year  term  and  no  publications  can  be  exf  ected  until  at 

*  v 

least  the  Pall  of  19^7 . 
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Last  year,  Simmons'  review  of  semantic  theory  suggested  that 
there  now  exist  several  "major  fragments"  of  a  comprehensive 
theory  of  semantics.  He  mentioned  specifically  the  work  of  Katz, 
Fodor  and  Postal,  of  the  Cambridge  Language  Research  Unit,  and  of 
Quillian,  and  implied  that  these  separate  fragments  might  even¬ 
tually  come  together  to  make  some  harmonious  semantic  theory. 
Perhaps  this  is  true,  but  no  such  coalescence  has  aj.  eared  this 
year,  and,  in  fact,  a  good  percentage  of  the  year's  publications 
in  this  field  seem  to  have  amounted  to  attacks,  open  or  implicit, 
by  one  semantic  theorist  on  another,  especially  attacks  on  and  by 
the  Katz  group.  Berlyne  (3),  Osgood  (86),  and  Bolinger  (8)  have 
all  published  papers  written  specifically  as  critical  attacks  on 
the  Katz  position.  Two  other  statements  of  approaches  to  seman¬ 
tics  that  appeared  during  the  year  (Weinreich  (117),  and  Quillian 
(91),)  bc'.h  went  to  considerable  lengths  to  show  why  they  thought 
Katz's  position  was  wrong,  if  not,  indeed,  absurd.  Katz  (60), 
meanwhile,  has  replied  vociferously  to  at  least  some  of  his 
detractors . 

We  will  not  recount  here  the  multifarious  disagreements  involved 
in  all  this,  much  less  take  sides.  However,  we  would  like  to 
propose  one  sort  of  "dimension,"  along  which  it  seems  many  seman¬ 
tic  theories  may  be  located.  Essentially,  this  dimension  is  the 
degree  of  complexity  of  the  material  that  is  assumed  to  constitute 
semantic  information.  For  a  performance  model,  this  information 
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is  what  wouxd  have  to  be  stored  in  the  memory  of  the  device  that 
produced  and/or  comprehended  language.  A  position  lying  near 
the  bomplexlty"  end  of  this  dimension  would  assume  that  semantic 
information  consists  of  complex  configurations,  forming  overall 
a  network  of  nodes,  interrelated  to  one  another  by  different 
kinas  of  links.  A  position  closer  to  the  middle  of  this  contin¬ 
uum  might  assume  that  semantic  information  is  structured  into 
trees,  but  still  trees  using  several  different  kinds  of  labeled 
linkages.  Further  along  on  this  continuum  one  might  find  a 
theory  that  assumed  a  tree  structure  for  semantic  information 
but  now  one  that  used  only  a  single  kind  of  linkage  between  nodes 
of  the  tree.  Finally,  a  position  near  the  "simplicity’'  end  of 
our  continuum  might  assume  thav  semantic  information  consists 
simply  of  unordered  aggregates  of  semantic  features.  Semantic 
theories  of  today  seem  to  be  spread  all  along  this  continuum. 

The  semantic  theory  of  Katz  (61,60)  and  his  co-authors,  Fodor 
(29)  ana  Postal  (hereafter  KFP)  lies  very  close  to  the  simplicity 
end  of  this  continuum,  although,  as  Weinreich  points  out,  the 
KFP  theory  really  seems  to  occupy  two  different  positions  at  once. 
Both  of  these  positions  are  near  the  simplicity  end  of  cur  con¬ 
tinuum.  First,  KFP  assumes  that  the  semantic  information  to  be  pre¬ 
stored  in  the  lexicon  is  all  to  be  in  the  form  of  very  simple 
trees  with  unlabeled  links,  with  essentially  two  kinds  of  labeled 
nodes  (markers  and  distinguishers ) .  Second,  however,  KFP  says 
that  the  final  result  of  the  semantic  component  of  a  language  is 
to  be  an  even  simpler  sort  of  information,  viz,  a  single  bundle 
of  unordered  properties  for  each  sentence. 

Weinreich  feels  it  would  be  desirable  to  be  consistent  about  the 
complexity  of  semantic  structures  (p.  419): 
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Another  way  of  saying  it  is  that  definitions  of  words 
have  semantic  structures  of  the  same  general  form  as 
sentences  of  a  language....  This  principle  explains 
the  possibility  of  freely  introducing  new  words  into 
a  language  by  stipulating  their  meanings  through  ex¬ 
pressions  couched  in  words  of  language  L.  Of  course, 
a  particular  complex  expression,  if  it  is  not  tautologi- 
cous,  contains  more  features  in  its  semantic  structure 
than  any  one  of  its  constituents;  but  the  form  of  the 
structures,  as  represented  in  the  theory,  is  the  same 
for  simplexes  and  complexes. 

Weinreich  points  out  that  while  a  few  constructions,  e.g.,  simple 
adjective-noun  constructions  can  indeed  be  viewed  as  forming 
simple,  unordered  sets  of  semantic  features  (a  process  he  calls 
"linking"),  a  great  many  others  cannot,  and  he  proposes  other 
sorts  of  combination  procedures  --"nesting,"  "delimiting," 
"modalization"  —  to  produce  more  complex  configurations.  Thus, 
Weinreich' s  own  position  on  our  continuum  lies  somewhat  further 
out  toward  the  complexity  end  than  does  KFP,  since  Weinreich 
would  allow  for  semantic  information  of  somewhat  more  complex  form. 
However,  Weinreich  would  still  like  to  treat  as  many  syntactic 
constructions  as  possible  as  forming  simple  unordered  sets  of 
features.  Weinreich  also  discusses  a  number  of  other  issues  in 
his  paper,  and  argues  that  his  view  of  semantics  is  much  more 
compatible  with  transformational  grammars  than  is  KFP.  However, 
on  the  issue  of  how  semantic  information  is  to  be  structured, 
Weinreich  only  proposes  notations  to  represent  nonlinking  con¬ 
figurations  when  this  notation  can  be  taken  readily  from  symbolic 
logic  or  transformational  theory.  As  a  consequence,  he  still 
thinks  of  semantic  information  as  arranged  into  structures  that 
are  restricted  to  trees,  although  he  suggests  at  several  points 
that  this  may  be  inadequate. 

Another  paper  with  assumptions  that  place  it  close  to  the  simplic¬ 
ity  end  of  our  continuum  is  that  of  Kiefer  (65),  who  says  (p.228): 
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...It  seems  obvious  that  the  semantic  relations 
involve  a  hierarchic  structure  of  semantic  cate¬ 
gories,  therefore  the  semantic  relations  are  defined 
with  such  a  system  of  semantic  categories  taken  for 
granted. 

Kiefer  goes  on  to  explore  the  implication!)  that  would  follow  if 
there  were,  for  any  one  language,  a  single'  hierarchy  into  which 
all  semantic  categories  could  be  placed.  He  finds  that  one  could 
then  define  different  kinds  and  degrees  of  similarity  between 
word  meanings,  depending  upon  the  number  and  remoteness  of  what 
amounts  to  the  superset  memberships  they  share.  On  the  other 
hand,  Kiefer  does  say  (p.  226-9) 


...The  deeper  we  penetrate  into  the  semantics  of  natural 
language  the  more  structured  the  set  of  semantic  cate¬ 
gories  seems  to  be.  To  put  it  differently,  the  more 
facts  about  language  we  want  to  describe  by  means  of 
semantic  categories,  the  more  we  have  to  impose  on  Kg 
(in  the  semantic  memory  store)  a  complicated  structure. 

...So  far,  we  do  not  know  how  comp]icated  the  structure 
of  Kg  really  is. 

The  position  of  the  C.L.R.U.  group  (110)  on  our  spectrum  seems  to 
lie  further  toward  the  complexity  end  than  any  of  those  yet  con¬ 
sidered,  since  the  semantic  structures  they  assume  are  nets  of  re¬ 
lations,  without  a  tree  structure  necessarily  imposed,  or  any 
limitation  that  semantic  elements  all  be  single  binary  features, 
such  as  +_animate.  However,  the  C.L.R.U.  semantic  network  is 
still  simple  in  that  it  does  not  label  the  kind  of  link  connect¬ 
ing  semantic  categories.  The  C.L.R.U. 's  effort  has  been  devoted 
to  the  question  of  how,  on  the  one  hand,  one  might  derive  such 
links  from  empirical  investigation,  or,  on  the  other  hand,  to 
how  one  might  cluster  words  or  concepts  together  into  categories. 
The  C.L.R.U.'s  clustering  techniques  rely  on  counting  the  number 
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of  similarities  found  between  categories,  and  never  on  any  quali¬ 
tative  difference  between  kinds  of  link.  It  should  be  noted  that 
these  research  aims  are  thus  in  contrast  to  the  other  work  re¬ 
ported  here,  which  is  concerned  with  how  semantic  information 
could  be  used  if  one  did  have  it,  rather  than  how  to  obtain  that 
information. 

At  about  the  same  level  of  complexity  as  the  C.L.R.U.  concepts, 
one  finds  the  networks  proposed  as  models  of  memory  in  certain 
psychological  simulation  models,  for  example,  by  Prijda  (35)  and 
also  in  the  "EPAM"  models.  These  later  models,  which  use  dis¬ 
crimination  nets,  are  relatively  quite  well  understood  and  have 
properties  which  many  people  working  in  language,  especially 
linguists  seem  not  to  be  aware  of.  Wynn  (119)  has  given  a  par¬ 
ticularly  thorough  and  insightful  Investigation  of  EPAM-type 
models.  It  seems  to  us  that  a  study  of  such  graphs  could  add  a 
great  deal  to  the  linguist's  knowledge  of  what  sorts  of  structures 
for  semantic  information  are  available  to  be  used  in  conjunction 
with  their  syntactic  models.  In  this  regard,  the  works  of  Reich 
(92),  although  incomplete  and  somewhat  chaotic,  are  at  least  a 
first  attempt  to  forge  a  merger  between  linguistic  problems  and 
an  appreciation  of  what  can  be  accomplished  with  a  sophisticated 
network  model. 

An  interesting  exploration  of  general  properties  of  graphs  was 
reported  during  the  year  by  Mann  &  Jensen  (79).  These  authors 
describe  techniques  allowing  rapid,  guided  searching  of  fairly 
complex  networks,  although  their  techniques  are  very  much  limited 
by  being  applicable  only  to  the  finding  of  a  path  between  two 
given  nodes  that,  although  perhaps  long,  still  always  has  all  of 
its  links  running  in  one  direction.  Two-way  paths,  i.  e.,  ones 
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made  up  of  two  separate  subpaths  leading  from  each  of  the  given 
nodes  to  some  third  common  node.  The  importance  of  allowing 
this  kind  of  path  through  a  common  ingredient  has  been  demonstrated 
by  Quillian’s  program  (91). 

Finally,  at  probably  the  farthest  extreme  of  the  continuum  tnat 
we  have  proposed  lies  Quillian's  model.  This  represents  semantic 
information  by  general,  recursive  graph  structures  (which  unlike 
tree  structures  also  allow  loops  within  the  data  network)  and  by 
complex  configurations  built  up  with  qualitatively  differing 
kinus  of  links.  Quillian  argues  that  the  most  important  point 
about  such  a  network  is  that  it  can  be  used  by  a  computer  to  ans¬ 
wer  a  great  many  more  semantic  questions  than  were  anticipated 
at  the  time  the  information  was  stored,  because  such  information 
is  rich  enough  to  allow  its  use  in  reorganized  form  without  be¬ 
coming  nonsense. 

Simmons  (106,  also,  is  now  working  with  a  network  model  similar 
to  that  advocated  by  Quildan  and  is  addressing  himself  in  particu¬ 
lar  to  the  question  of  how  inferences  within  data  stored  in  such 
a  network  may  be  made.  Simmons'  work  is  discussed  further  in 
Section  V.  The  inference  problem  in  such  networks  reduces  to 
the  question  of  when  one  path  connecting  two  nodes  in  the  data 
store  can  be  rewritten  and/or  collapsed  into  another  shorter  path. 
If  a  connective  is  rewritten,  some  paraphrase  of  the  original 
relationship  is  generated.  If  a  long  path  is  collapsed  into  a 
shorter  one,  an  Inference  has,  in  effect,  been  drawn.  Two  worKs 
relevant  to  these  processes  have  appeared  recently.  The  first  is 
a  dissertation  by  Elliot  (23)  in  which  he  attempts  to  describe 
prepositions  by  pronerties  relevant  to  such  inference  making. 

To  describe  these  prepositions,  Elliot  includes  standard 
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properties  sucn  as  transitivity,  reflexivity,  etc.,  is  well  as 
some  properties  less  often  considered  in  this  context,  such  as 
"one-leaaer . "  If  prepositions  or  something  like  them  are  to 
appear  as  connectives  in  a  network  of  semantic  data,  then  such 
properties  are  essential  to  drawing  inferences  from  that  data. 

The  second  interesting  work  in  regard  to  inference  making  is  the 
paper  by  Fillmore,  mentioned  earlier  in  the  section  on  syntactic 
theory.  Fillmore  is  specifically  concerned  about  how  best  to 
formulate  parts  of  a  transformational  grammar,  but  his  work  is 
relevant  to  the  type  of  connectives  that  should  be  used  in  a 
store  of  semantic  data.  His  point,  essentially,  is  that  to 
specify  that  some  noun  is,  say,  the  subject  of  some  verb,  is  an 
inadequate  specification  of  their  relationship  .  This  inadequacy 
shows  up  in  a  network  model  when  the  verb  has  to  be  rewritten  to 
derive  or  recognize  a  paraphrase.  Thus,  the  verb  "to  swarm"  can 
be  rewritten  as  "to  cluster,"  if  its  subject  is,  say,  "bees." 
However,  if  the  subject  of  swam  is,  say,  "garden,"  as  in  "the 
garden  swarms  with  bees,"  then  rewriting  the  verb  as  "clusters" 
produces  an  erroneous  paraphrase,  unless  the  verb's  subject  is 
also  changed.  This  example  shows  that,  within  a  store  of  seman¬ 
tic  information,  a  link  labeled  "subject  of"  is  inadequately 
differentiated.  See  Quiliian  (91)  for  more  discussion  of  Fill¬ 
more's  work  from  this  point  of  view. 

In  summary,  it  would  appear  that  there  are  now  two  principal 
techniques  available  for  representing  the  structures  of  semantic 
information  that  is  to  be  associated  with  language.  One  of  these 
techniques  is  to  view  this  information  as  generated  by  some  set 
of  recursive  rules,  such  as  a  grammar  or  a  computer  program.  The 
other  is  to  view  this  information  as  some  kind  of  network  of 
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associated  links.  Either  of  these  techniques  may  be  simple,  and 

consequently  bear  low  information,  or  complex,  and  consequently 

rich  in  information.  Skinner's  view  of  language  (108)  was  that 

a  very  simple  associative  net,  and  what  amounted  to  trivially 

simple  rules.,  could  adequately  explain  language.  Chomsky  (16) 

argued  tellingly  against  this  view,  proposing  that  much  more 

% 

complex  rule  systems  were  required.  Some  of  Chomsky's  followers 
have  proposed  semantic  theories  to  be  based  on  syntactic  rule 
systems  and  what  amounts  to  a  store  of  information  structured  so 
minimally  that  it  contains  relatively  little  information.  This 
may,  in  fact,  be  acceptable.  On  the  other  hand,  this  may  put  an 
impossible  load  on  the  system  of  csemantic  rules. 
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SECTION  IV 


PSYCHOLINGUISTIC  STUDIES  RELATED  TO  SEMANTICS 

The  number  of  studies  moro  or  less  relevant  to  semantics  has 
continued  to  increase  during  the  year.  A  good  number  of  these 
have  been  psychologically  oriented  and  connected  in  one  way  or 
another  with  the  Cognitive  Studies  Center  at  Harvard.  Early  work, 
such  as  that  by  Miller  (82),  appeared  to  support  the  hypothesis 
that  people  really  operate  with  something  functionally  similar  to 
a  transformational  grammar  in  their  heads.  Some  of  the  strongest 
apparently  supportive  data  for  this  hypothesis  is  to  be  found  in 
a  study  by  Savin  (49),  which  appears  to  show  that  the  transfor¬ 
mational  complexity  of  a  sentence  directly  predicts  the  amount  of 
short-term  memory  space  a  subject  has  left,  after  reading  the 
sentence,  to  use  for  storing  a  list  of  nonsense  syllables.  Other 
supportive  evidence  was  reported  by  Fodor  (28),  who  found  that 
subject  tended  to  perceive  a  click  superimposed  on  an  aurally 
presented  sentence,  not  at  tne  place  where  the  click  was  super¬ 
imposed,  but  instead  slightly  displaced  forward  t.r  backward  in 
the  sentence;  moreover  this  displacement  could  be  predicted  by 
the  syntactic  constituent  structure  of  the  sentence.  However, 
followup  studies  often  seem  to  have  been  discouraging  for  those 
who  might  like  to  think  that  transformations  have  any  simple  or 
direct  psychological  validity.  The  conflicting  evidence  on  both 
sides  is  too  tentative  and  diverse  to  attempt  to  synthesize  here, 
but  many  of  the  psychological  studies  involved  are  summarized  in 
the  Harvard  Cognitive  Center  reports  (48)  and  in  the  survey  by 
Ervin-Tripp  &  Slob in  (24). 


An  .her  major  source  of  information  on  such  studies  is  the 
Juu  1  of  Verbal  Learning  and  Verbal  Behavior.  The  interested 
reader  will  also  want  to  look  into  the  survey  by  Creelmann  (19), 
which  is  a  very  thorough  examination  of  "semantic  generalization" 
studies,  stemming  from  Razran's  original  work.  Another  recent 
study,  done  by  Kohlers  (66),  would  appear  to  be  very  damaging  to 
the  hypothesis  that  a  single  grammar  must  appear  as  a  "component 
of  any  reasonable  i  rfOx^mance  model"  (unless,  of  course,  one 
denies  that  a  human  is  a  reasonable  performance  model!).  Kohlers 
presented  bilingual  subjects  with  sentences  primarily  of  one 
language,  but  with  a  few  words  of  those  sentences  translated  into 
the  second  language  that  the  subject  knew.  He  found  that  these 
subjects  could  read  such  texts  silently  at  almost  the  same  speed 
they  could  read  single-language  text,  but  if  asked  to  read  aloud, 
their  time  increased  very  significantly.  Kohlers  feels  this 
result  cannot  be  wholly  explained  simply  by  increased  motor 
difficulty  involved  in  switching  the  position  of  the  tongue, 
throat,  etc.,  when  speaking  different  languages,  so  that,  it 
would  seem,  some  really  different  information  processing  must 
occur  in  production  of  sentences  as  distinct  from  comprehension 
of  sentences. 

Another  psychological  study  relevant  to  semantics  is  reported  by 
Paige  &  Simon  (87),  who  in  part  were  attempting  to  find  out 
whether  or  not  people  solving  algebra  word  problems  think  in  a 
manner  similar  to  the  way  that  Bcbrow’s  computer  program, 

STUDENT  (4),  worked.  On  this  question,  the  answer  seems  to  be 
a  mixed  yes  and  no,  depending  upon  the  particular  person,  on  the 
instructions  given  him,  and  so  on.  One  interesting  finding  resulted 
from  giving  the  subjects  problems  that,  in  fact,  violated  physical 
reality  (such  as  requiring  for  solution  that  a  board  be  of  nega¬ 
tive  length).  Some  subjects  simply  mispcrceived  such  problems  so 


that  they  did  make  sense,  thus  illustrating  the  pervasive 
influence  of  stored  knowledge  of  the  task  on  sentence  comprehen¬ 
sion. 

Two  studies  dealing  with  the  disambiguation  of  polysemantic 
words  occurred  during  the  year,  one  primarily  an  analytic  study, 
by  Rubinstein  (97),  the  other,  a  computer  simulation  by  Quillian 
(9-1-).  In  the  latter  study  it  was  found,  somewhat  to  the  investi¬ 
gator's  surprise,  that  a  computer  program,  working  with  almost  no 
syntactic  information  whatsoever,  was  able  to  correctly  disambi¬ 
guate  12  out  of  19  ambiguous  words  in  5  sentences  of  running  text. 
The  program  utilized  semantic  information  stored  in  network  form. 

Another  interesting  empirical  study  is  reported  by  MacKay  (78). 

In  his  study,  subjects  were  asked  to  complete  ambiguous  sentences. 
Judging  the  difficulty  of  the  task  by  the  time  necessary  for  the 
subject  to  complete  the  sentences,  MacKay  found  that  he  could 
distinguish  between  ambiguities  at  three  levels:  lexical,  derived 
(surface)  structure,  and  deep  structure.  This  supports  the 
psychological  significance  of  Chomsky's  formulation  of  a  grammar. 
In  addition,  the  fact  that  very  few  subjects  even  noted  the  am¬ 
biguities  indicates  perhaps  that  analysis  and  understanding  are 
usually  concerned  with  mapping  input  onto  some  one  consistent, 
previously  viable,  interpretation  and  not  with  a  search  for  all 
possible  meanings.  MacKay  concludes  with  the  observation  (p.^26): 

An  attempt  to  fit  these  results  to  several  theories  of 
the  processing  of  ambiguous  sentences  led  us  to  the  con¬ 
clusion  that  ambiguity  Interferes  with  our  understanding 
of  a  single  meaning  of  a  sentence,  and  that  the  degree 
of  interference  varies  with  the  linguistic  level  at  which 
the  ambiguity  occurs. 
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Another  type  of  study  pertinent  to  semantic  and  syntactic  per¬ 
formance  models  is  that  which  studies  the  acquisition  of  language 
in  children.  Earlier  work  in  this  area  is  summarized  in  Brown 
(11)  and  in  Ervin-Trip  &  Slobin  (24). 

Another  study  that  promises  to  be  of  great  interest  to  semantic 
theory  is  being  undertaken  by  John  Olney*  at  System  Development 
Corp.  Olney  is  keypunching  the  entire  Webster's  Seventh  Collegi¬ 
ate  Dictionary  and  will  try  to  answer  basic  questions  in  lexico¬ 
graphy,  such  as  which  words  and  which  kinds  of  relationships  are 
used  most  often  in  the  definitions  of  other  words.  This  is, 
therefore,  an  empirical  investigation  of  the  semantic  network  in¬ 
herent  in  an  actual  dictionary,  which  should  prove  of  considerable 
interest . 
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SECTION  V 


COMPUTER  SYSTEMS 

Computer  systems  lie  on  a  continuum  between  two  extremes.  At 
one  end  we  have  systems  which  manipulate  language  for  its  own  sake, 
providing  linguists  a  tool  to  better  study  the  English  language 
(and  other  languages).  On  the  other  extreme  we  have  the  attempts 
to  make  useful  systems  (or  at  least  demonstration  systems)  which 
accept  natural  language  questions  as  input  and  then  utilize 
some  data  base  to  answer  the  questions.  Of  course  in  some  of 
these  question-answering  systems  part  of  the  aim  is  to  develop 
a  performance  theory  of  language;  and  similarly  some  of  the 
theoretical  tools  are  also  thought  of  as  being  steps  toward 

systems  which  will  ultimately  be  able  to  communicate  in  natural 
language . 

A  large  project  at  IBM  (80  )  is  concerned  with  the  specification  and 
utilization  of  a  transformational  grammar.  This  project  is  in¬ 
vestigating  the  applicability  of  transformational  theory  as  a 
theoretical  support  for  linguistics  aspects  of  language  data 
processing.  As  mentioned  earlier,  this  IBM  group  has  been  de¬ 
veloping  a  transformational  grammar  of  English  which  includes 
a  large  lexicon.  In  addition  they  have  been  developing  computer 
programs  to  aid  in  the  grammatical  and  lexicographic  work.  They 
have  implemented  a  LISP  program  to  test  the  generative  power  of 
a  transformational  grammar.  Given  the  deep  structure  of  a 
sentence  (In  a  notation  developed  by  the  group)  a  sequence  of 
transformation  names,  and  a  transformational  grammar,  It  will 
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run  through  the  proposed  derivation  of  the  sentence,  checking 
for  the  applicability  of  the  requested  transformations  and  the 
applicability  of  other  obligatory  transformations  not  requested, 
but  possible.  Thus  this  batch-processing  system  can  help  a 

| 

linguist  to  determine  the  consistency  of  a  grammar  and  the 
language  it  generates.  Systems  similar  in  purpose  to  this  IBM 

£ 

system  are  currently  being  designed  for  on-line  use  at  Stanford, 
by  Joyce  Friedman,  at  System  Development  Corporation,  by  Londe, 
and  at  Bolt  Beranek  and  Newman,  by  Bobrow,  Fraser,  and 
Teitelrnan. 

The  program  at  IBM  for  support  of  lexicon  development  involves 
removing  much  of  the  clerical  work  from  the  developer.  It 
allows  additions,  deletions,  and  modifications  to  the  lexicon, 
partial  or  complete  printout  in  several  convenient  formats,  and 
scans  of  various  sorts  to  assist  the  lexicographer.  This  pro¬ 
gram  is  written  in  SNOBOL  for  the  IBM  7094. 

The  IBM  system  described  above  is  useful  in  helping  a  linguist 
test  a  generative  grammar.  For  analysis  of  English  language 
input  to  a  computer,  however,  syntactic  analysis  programs  are 
necessary.  At  the  MITRE  Corp.,  there  has  been  continued 
development  of  their  transformational  grammar  parsing  system. 
This  system  was  described  in  Simmons  survey  last  year  and  in 
Kuno's  recent  survey  (68).  Differences  between  this  system  and 
a  similar  transformation  parsing  system  developed  by  Petrick 
(89)  were  also  described.  We  point  out  here  that  only  Petrick's 
system  represents  an  algorithm  which  handles  formally  defined 
class  of  transformational  grammar.  The  current  version  is  run¬ 
ning  in  LISP  on  the  UNIVAC  M-460,  and  IBM's  M44,  7094,  7044, 
and  Q-32.  In  this  past  year  a  lexical  lookup  system  has  been 
programmed  for  the  MITRE  system,  as  reported  by  Walker  & 
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Bartlett  (115).  In  addition,  an  efficient  context-free  parsing 
algorithm  for  determining  surface  structure  has  been  programmed. 
This  algorithm  allows  a  compact  representation  of  multiple  trees 
("super-trees")  in  the  same  structure.  In  application  of  the 
reversal  of  the  generative  transformation  rules  in  finding  a 
transformational  (deep  structure)  parsing,  a  technique  has  been 
developed  to  apply  these  transformations  to  these  "super-trees," 
which  represent  simultaneously  a  number  of  phrase  structure  trees. 

Improvement  in  the  MITRE  grammar  continued  to  be  made  and  several 
versions  of  the  MITRE  grammar  have  been  issued.  The  most  impor¬ 
tant  change  that  has  been  made  since  the  most  recent  version 
appeared  is  a  new  treatment  of  coordinate  conjunction  by  Schane 
(111).  Other  work  in  progress  deals  with  lexical  analysis  and 
the  incorporation  into  the  grammar  of  syntactic  features. 

Some  interesting  investigations  of  the  properties  of  the  MITRE 
syntactic  analysis  procedure  have  been  carried  out  by  Friedman. 

In  particular,  Friedman  (33)  shows  that  an  algorithm  exists  for 
constructing  the  surface  grammar  in  the  form  of  the  grammar  if 
a  weak  condition  on  surface  structure  is  met,  namely,  that  no 
symbol  has  two  nonadjacent  occurrences  in  a  branon  without  an 
intervening  S  node,  where  S  is  the  only  recursive  symbol  in  the 
grammar. 

Elsewhere,  Friedman  (32)  proves  that  there  are  transformational 
grammars  for  which  no  surface  grammar  exists  unless  suitable 
restrictions  are  Imposed  on  the  grammar.  If  this  is  the  case, 
then  algorithms  for  the  construction  of  surface  grammars  do 
exist  but  there  is  no  algorithm  for  a  minimal  surface  grammar. 

A  third  paper  by  Friedman  (3^)  discusses  the  validity  and  genera¬ 
lity  of  the  MITRE  syntactic  analysis  procedure.  The  problem  of 
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the  validity  concerns  the  fact  that  although  the  current  MITRE 
procedure  is  thought  to  be  correct  for  the  specific  grammars  for 
which  it  was  developed,  there  is  no  proof  that  some  cases  have 
not  been  overlooked.  The  generality  problem,  on  the  other  hand, 
concerns  the  difficulties  of  how  to  automatically  generate  the 
additional  components  required  to  make  the  current  procedure 
applicable  to  any  one  of  a  class  of  grammars.  Possible  approaches 
to  the  solution  of  both  these  problems  are  presented. 

Work  on  predictive  analysis  of  English  h3'  been  continued  at 
Harvard  under  the  leadership  of  Susumo  Kuno.  This  system  was 
described  in  last  year's  survey.  It  has  been  put  in  a  broader 
context  in  Kuno's  brilliant  survey  "Computer  Analysis  of 
Natural  Languages"  (66),  in  which  he  pi’esents  a  coherent  overview 
of  the  entire  field.  This  paper  provides  an  excellent  introduc¬ 
tion  for  the  novice  and  a  good  review  for  the  sophisticated 
researcher  in  the  art  of  language  processing.  Recently  Kuno  (67) 
reported  on  an  algorithm  that  will  accept  any  context-free  gram¬ 
mar  and  convert  it  into  a  standard-form  grammar  for  the  predictive 
analyzer.  In  addition,  the  standard-form  grammar  produced  by  the 
analyzer  Is  augmented  in  such  a  way  that  the  trees  produced  by 
the  predictive  parser  provide  information  about  derivations  that 
would  have  been  found  using  the  original  context-free  grammar. 

This  augmented  predictive  analyzer,  as  a  parsing  algorithm  for 
arbitrary  context-free  languages,  is  compared  with  two  other 
parsing  algorithms:  a  selective  top-to-bottom  algorithm  similar 
to  Irons'  (55)  "error-correcting  parse  algorithm,  and  an  immediate 
constituent  analyzer  that  is  an  extension  of  the  Sakai-Cocke 
algorithm  (99)  for  normal  grammars.  Tne  comparison  is  based  on 
several  criteria  of  efficiency,  covering  core  storage  requirements, 
complexity  of  the  programs,  and  processing  time.  Then  comparison 
concluded  that  the  augmented  predictive  analyzer  is  a  parsing 
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algorithm  comparable,  to  if  not  superior,  to  the  other  parsing 
algorithms  in  its  overall  efficiency.  The  selection  of  one 
algorithm  over  the  others  depends  on  individual  application  and  on 
one's  own  taste. 

Some  interesting  wotk  in  predictive  analysis  has  been  coming  from 
the  English  Language  Research  Unit  at  the  University  of*  Edinburgh. 
In  their  paper,  "A  Model  for  the  Perception  of  Syntactic  Structure, 
Thorne  et_  al  (llj)  propose  a  predictive  analysis  system  that  is 
designed  to  be  a  first  stage  in  a  transformational  parsing  system. 

In  contrast  to  the  Harvard  System,  their  program  is  considered 
to  be  a  model  for  syntactic  perception,  rather  than  an  automatic 
parsing  device.  The  principal  implication  of  this  goal  is  that 
it  is  not  essential  to  have,  at  the  first  stage,  a  routine  that 
looks  up  every  word  in  a  form-class  dictionary,  as  is  done  in  the 
Kuno  system.  Part  of  their  reason  for  rejecting  such  a  dictionary 
lookup  (which  would  operate  on  every  word  in  a  sentence)  is  that, 
in  most  cases,  the  information  obtained  in  this  wa^  increases  the 
complexity  of  the  initial  analysis  procedure  rather  than  simpli¬ 
fying  it.  For  example,  in  English  virtually  every  word  that  can 
be  a  noun  can  also  be  a  verb;  given  that,  a  decision  must  be  made 
as  to  which  of  the  two  roles  a  word  is  playing  in  a  sentence 
under  consideration.  It  therefore  seems  more  economical,  if 
nothing  else,  to  work  out  the  part  of  speech  in  terms  of  fulfill¬ 
ment  of  predictions  based  upon  nonambiguous  items  in  the  same 
sentence.  They  propose  to  have  a  closed-class  dictionary  of 
approximately  2000  items,  and  some  affix  rules  to  give  them  other 
information  concerning  words  in  the  sentence.  The  result  of  their 
predictive  analysis  is  a  rough  bracketing  of  the  sentence,  which 
is  all  they  claim  is  needed  for  a  second-stage  transformational 
analysis.  This  bracketting  will  contain  far  fewer  ambiguities 
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tnan  might  be  obtained  if  the  substructure  of  these  rough  groupings 
were  elaborated.  This  first-stage  predictive  analyzer  is  said  to 
be  working,  and  a  full  transformational  parsing  system  should  be 
working  sometime  in  1967. 

Ernest  von  Glasersfeld  (40)  and  his  group,  who  have  just  moved  to 
Athens,  Georgia  from  Milan,  Italy,  have  a  different  and  interest¬ 
ing  approach  to  the  analysis  of  natural  language.  They  have  been 
exploring  the  applicability  to  English  of  a  correlational 
grammar,  as  implemented  in  a  multi-store  parsing  procedure, 
similar  to  the  Sakai-Cocke  algorithm  mentioned  earlier.  In 
their  recent  report,  they  describe  advances  in  several  areas: 
classification  of  words  in  terms  of  correlation  ind.lces,  i.e., 
by  means  of  code  numbers  representing  the  word  possibilities 
of  forming  syntactic  combinations  with  other  words;  reclassifica¬ 
tion,  i.e.,  assigning  correlation  indices  to  word  combinations; 
and  grammatical  and  semantic  factor  analysis,  i.e.,  analysis  of 
grammatical  functions  and  semantic  content  in  terms  of  constant 
factors  across  several  languages.  The  most  outstanding  example 
of  this  latter  work,  by  Brian  Dutton  of  this  group,  is  his 
analysis  (22)  of  factors  underlying  the  use  of  pronouns  in 
languages . 

String  analysis  of  language  derived  from  work  of  Harris  at  the 
University  of  Pennsylvania  (114)  has  been  continued  by  Sager  at 
the  Linguistic  String  Project  at  New  York  University.  This 
group  has  a  number  of  working  parsing  programs  that  produce  an 
interesting  form  of  sentence  analysis.  Their  system  is  basically 
unchanged  from  that  described  in  the  survey  by  Bobrow  (67).  The 
output  of  analysis  is  a  center  string,  which  represents  the 
skeleton  of  a  sentence,  and  various  substrings,  which  are  adjuncts 
to  portions  of  the  sentence  and  modify  various  elements  in  the 
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sentence.  Recently,  they  have  described  a  procedure  for  mapping 
their  analysis  program,  in  which  the  grammar  is  embedded  in  the 
program  itself,  into  a  standard  Backus-normal-form  grammar. 

Bohnert  &  Backer  (7)  have  been  working  on  a  still  different 
approach  to  the  problem  of  representing  the  deep  structure  of 
natural  language.  Their  thesis,  as  with  Darlington  earlier,  is 
that  the  notation  of  symbolic  logic  offers  the  most  promising 
"deep  structure"  notation  for  natural  language  analysis,  and  that 
machine  translation  to  such  notation  is  the  best  proving  ground 
for  this  analysis.  Machine  translation  to  logic,  they  claim,  has 
a  practical  promise  in  that  the  result  can  be  acted  upon  by  logi¬ 
cal  deduction  programs.  Therefore,  if  results  are  stored  in  this 
logical  notation,  information  implicit  in  the  store  can  be  deduced 
by  standard  theorem-proving  techniques.  Their  approach  has  been 
to  begin  with  a  simple  logic-like  language,  which  they  claim  is 
subset  of  English,  and  then  work  step-wise  toward  a  more  fully 
English-like  syntax.  For  each  step  so  far  they  have  written  pro¬ 
grams  for  the  translation  of  the  language  into  logic.  This  is 
basically  an  outgrowth  of  the  Carnap  school  of  semantics.  The 
most  suggestive  part  of  their  work  is  in  exploration  of  certain 
parallels  between  mathematical  logic  and  English;  an  example  is 
factoring,  which  leads  to  their  own  interesting  theory  of  con¬ 
junction. 

A  number  of  papers  on  question-answering  systems  have  appeared 
this  year.  William  Woods  (50  1)  has  written  a  very  interesting 
paper  on  this  subject.  His  paper  proposes  a  uniform  method  for 
performing  semantic  interpretations  of  English  questions  for  a 
question-answering  system  that  has  a  well-structured  dat?i  base. 

In  particular,  the  Official  Airline  Guide  is  considered  as  a 
data  base  for  a  proposed  interpreter  that  takes  the  output 


-27- 


from  a  hypothetical  parsing  algorithm  of  the  Chomsky  transforma¬ 
tional  variety  and  translates  it  to  an  expression  in  a  formal 
query  language.  The  semantic  meaning  of  the  input  Question  is 
represented  in  a  form  similar  to  ordinary  quantiflcational  logic, 
with  an  additional  question  and  set-definitional  capabilJty.  The 
semantic  theory  underlying  the  interpretation  of  a  question  is 
not  the  KFF  theory,  but-  one  similar  to  that  developed  by'  Bobrow 
(4).  Relations  asserted  by  a  sentence  are  determined  by  the  main 
verb  and  its  modifiers,  and  properties  of  the  subject  and  objects. 
Arguments  of  relations  are  specified  by  noun  phrases,  which  may 
denote  specific  objects,  quantified  variables  (with  range  perhaps 
restricted  by  relative  clauses)  question  words,  and  functionally 
defined  objects.  Woods  presents  a  set  of  semantic  rules  in  a 
"pattern-faction"  format  to  specify  a  correspondence  between  usage 
restrictions  and  corresponding  semantic  meanings  of  elements  for 
the  question-answering  system  for  the  Official  Airline  Snide. 

The  major  drawback  of  this  paper  is  that  it  reflects  only  careful 
thought,  but  no  computer  implementation. 

Although  the  material  presented  is  clear,  and  apparently  consis¬ 
tent,  uhere  are  still  a  number  of  fuzzy  areas  (such  as  the 
feasibility  of  retrieval  given  a  specification  of  the  type  pro¬ 
posed),  which  would  only  be  cleared  up  by  an  implementation  of 
such  a  system.  Despite  this,  the  paper  is  well  worth  reading, 
because  of  its  clarity  and  the  number  of  new  ideas  it  presents 
about  treatment  of  language  objects.  It  generalizes  on  the 
approach  taken  by  Thompson  (112)  and  Craig  et_  al  (18)  on  the 
DEACON  system,  which  was  described  last  year,  and  in  Simmons' 
survey  of  question-answering  systems. 

Simmons  (106)  has  continued  work  on  question-answering.  In  his 
latest  approach,  questions  and  statements  are  syntactically 
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analyzed  to  permit  derivation  of  a  normalized  kernel  of  the  form 
subject-verb-nominal.  The  approach  to  kernelization  depends  on 
cues  offered  by  the  surface  structure  that  indicate  a  kernel 
string  should  be  derived.  When  kernels  have  been  derived  for-  a 
question  and  for  a  string  that  may  be  an  answer,  the  two  sets  of 
kernels  are  compared  for  an  identical  match.  If  this  is  not 
found,  the  pair  are  checked  for  a  match  after  certain  meaning- 
preserving  equivalence  operators  have  been  applied  to  the  kernels. 
The  aDproach  is  interesting  but  seems  limited  by  the  ad  hoc  method 
for  the  determination  of  the  kernel  strings.  The  system  also 
faces  problems  in  storing  kernels  in  a  fashion  that  makes  relevant 
ones  more  immediately  accessible  than  irrelevant  in  the  large  data 
base . 

Weizenbaum  (118),  in  a  new  version  of  ELIZA,  a  program  for  con¬ 
versing  in  English,  has  demonstrated  the  strong  effect  of  context 
on  understanding  of  English  input.  He  presents  cogent  arguments 
to  demonstrate  that  this  is  not  an  artifact  of  machines  but  is 
true  for  humans  as  well.  His  program,  as  it  stands,  provides  for 
switching  of  context  to  allow  different  interpretations  of  the 
same  input  as  a  function  of  history  and  frame  of  reference.  In 
two  distinctly  different  contexts,  one  simulating  a  psychiatrist 
of  Rogerian  persuasion,  and  another  simulating  a  mathematical 
assistant,  he  shows  how  the  same  remarks  can  be  interpreted  in 
two  very  different  ways  and  lead  to  two  very  different  types  of 
conversation.  Kis  program  is  designed  to  deal,  not  with  complete 
text,  but  rather  with  fragments  of  natural  language  that  occur  in 
real  conversations.  Therefore,  it  cannot  rely  on  texts  being 
grammatically  complete  or  correct.  (No  theory  that  depends  on 
parsing  of  presumably  well-formed  sentences  can  be  of  much  help 
in  this  task.)  Since  incomplete  utterances  are  characteristic  of 
communication  between  men,  Weizenbaum' s  approach  is  most  suggestive. 
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Qulllian  and  Bobrow  (  5  )  have  two  programs  under  development 
whose  merger  .  it  is  hoped,  will  be  able  to  accept  a  very  wide 
class  of  English  input  and  rep Drt  how  the  assertions  made  are 
related  to  a  prior  data  store.  The  first  program  is  a  predictive 
parser  with  only  a  very  limited  dictionary  of  closed  class 
words.  Its  output  will  be  a  rudimentary  "chunking"  or  bracketing 
of  the  elements  of  the  input  sentence,  rather  than  a  ful? 
characterization  of  its  syntactic  structure.  The  second  program 
is  a  semantic  comparison  program,  which,  for  each  chunk  in  the 
sentence,  will  report  that  the  meaning  of  the  chunk  is: 

1)  Equivalent  in  meaning  to  a  specified  part  of  the 
current  data  store. 

2)  Contradictory  to  the  current  data  store. 

3)  Incomprehensible,  given  the  current  data  store. 

*0  Not  equivalent,  but  sufficiently  similar  to 

information  in  the  store  to  allow  a  tentative 
interpretation  that  can  be  added  to  the  store. 

An  cn-line  "teacher"  will  provide  the  input  sentences  and 
monitor  the  responses  and  interpretation  of  the  program. 

This,  hopefully,  will  allow  graceful  growth  both  of  the 
semantic  store  and  of  the  capabilities  of  the  program,  thus 
increasing  the  breadth  of  "understanding"  of  the  system,  as 
a  function  of  context  and  the  history  of  interaction. 
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CONCLUSION 

No  great  stride  forward  has  been  made  this  year  toward  a  program 
that  can  really  "understand"  a  large  body  of  text.  In  fact, 
there  if.  no  indication  that  a  major  breakthrough  is  just  around 
the  corner.  Much  more  basic  research  seems  necessary  before  one 
can  think  of  a  general  language  processing  system. 

The  emphasis  on  the  science  rather  than  on  the  technology  of 
computational  linguistics  is  particularly  evident  in  the  report 
(85)  of  the  NRC  Automatic  Language  Processing  Advisory  Committee 
(ALPAC).  The  committee  undertook  a  two-year  study  of  the  use  of 
computers  in  the  translation  of  foreign  languages.  It  found 
little  justification  at  present  for  massive  support  of  machine 
translation  per  se,  finding  it  to  be,  overall,  slower,  less 
accurate,  and  more  costly  than  that  provided  by  human  translators. 
The  committee  recommended  that  computers  be  used  to  gain  a  deeper 
understanding  of  the  nature  of  language  and  for  machine-aided 
translation  rather  than  pure  machine  translation.  "Computational 
linguistics — study  of  parsing,  sentence  generation,  structure, 
semantics,  statistics  and  quantative  linguistic  matters,  including 
experiment  in  translation,  with  machine  aids  or  without,  should 
be  supported  as  a  science  and  not  be  judged  by  any  immediate  or 
forseeable  contributions  to  practical  translation."  Judging  by 
this  recommendation,  it  seems  that  the  direction  of  Federal 
support  (and  hence  much  work)  will  swing  more  toward  the 


theoretical  and  basic  research  end  of  the  spectrum  than  toward 
the  engineering  and  practical  approach  at  this  time. 
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